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Abstract - The absence of time-reversal symmetry is a fundamental property of many nonlinear 
time series. Here, we propose a set of novel statistical tests for time series reversibility based on 
standard and horizontal visibility graphs. Specifically, we statistically compare the distributions 
of time-directed variants of the common graph-theoretical measures degree and local clustering 
coefficient. Unlike other tests for reversibility, our approach does not require constructing surro- 
gate data and can be applied to relatively short time series. We demonstrate its performance for 
realisations of paradigmatic model systems with known time-reversal properties as well as pickling 
up signatures of nonlinearity in some well-studied real- world neuro-physiological time series. 
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Introduction. — Nonlinear processes govern the dy- 
, <~| namics of many real- world systems. Therefore, a sophisti- 
Q-cated diagnostics and identification of such processes from 
observational data is a common problem in time series 
analysis important for model development. Consequently, 
in the last decades, testing for linearity of time series has 
^^been a question of paramount interest. A multiplicity of 
approaches has been developed for identifying signatures 
I of different types of nonlinearity as a necessary precondi- 
tion for the possible emergence of chaos [l] § 5.3]. Specif- 
I ically, a distinguished manifestation of nonlinearity is the 
asymmetry of certain statistical properties under time- 
i-H reversal (the foundation of the Q statistics introduced by 
Theiler et al. [2] in their seminal paper on surrogate data 
tests for linearity), since linearity of Gaussian processes 
directly implies time-reversibility [3}|5j (see [6j § 4] for fur- 
^jther details). Therefore, studying reversibility properties 
of time series is an important alternative to the direct 
quantitative assessment of nonlinearity [7] . 

Motivated by the enormous success of complex network 
theory in many fields of sciences M, in the last years 
several techniques for network-based time series analy- 
sis have been proposed [9}{l5]. As a particularly suc- 
cessful example, visibility graphs (VGs) and related meth- 
ods 10,11 are based on a concept originally introduced 



the concept for statistically investigating structural prop- 
erties of univariate (scalar-valued) time series. They could 
show that the degree distributions of the thus constructed 
VGs (see Methods) allow classifying time series accord- 
ing to the type of recorded dynamics and obey character- 
istic scaling in case of fractal or multifractal behaviour 
of the data under study 17 18 . These relationships 



make VGs promising candidates for studying observa- 
tional time series from various fields of research such as 
turbulence pi, finance fll^[M[2T] , physiology [l7p2"}|2"3" 



for analysing the mutual visibility relationships in two- 
dimensional landscapes 16 . Lacasa et al. [To] adopted 



or geosciences 24-28 

In [29], Lacasa et al. demonstrated that horizontal vis- 
ibility graphs (HVGs) [IT], an algorithmic variant of VGs 
(see Methods), allow discriminating between reversible 
and irreversible time series. Based on a time-directed ver- 
sion of HVGs, they could show that irreversible dynamics 
results in an asymmetry between the probability distribu- 
tions of the numbers of incoming and outgoing edges of all 
network vertices, which can be detected by means of the 
associated Kullback-Leibler divergence. In this work, we 
extend this idea and provide a set of rigorous statistical 
tests for time series reversibiliby, which can be formulated 
based on both standard and horizontal VGs. Specifically, 
we demonstrate that for VGs and HVGs, degrees as well as 
local clustering coefficients can be decomposed into con- 
tributions from past and future observations, which al- 
lows studying some of the time series' statistical proper- 
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ties under time-reversal. We find statistically significant 
deviations between the distributions of time-ordered ver- 
tex properties for nonlinear systems for which the absence 
of time-reversal symmetry is known, but not for linear 
systems. As a real-world application, the power of the 
proposed approach for studying the presence of nonlinear- 
ity in neuro-physiological time series (EEG recordings) is 
demonstrated. 

Methods. — Visibility graphs are based on a simple 
mapping from the time series to the network domain ex- 
ploiting the local convexity characteristics of scalar-valued 
time series {x(ti)}fL 1 . Specifically, each observation x(t,) 
is assigned a vertex i of a complex network, which is 
uniquely defined by the time of observation tj. Two ver- 
tices i and j are linked by an edge (i,j) iff the condi- 
tion [To 

ti — tf. 



u 



u 



(i) 



applies for all vertices k with U < tk < tj . This is, the ad- 
jacency matrix (Aij) describing the VG as a simple undi- 
rected and unweighted network reads 
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(2) 

where O(-) is the Heaviside function. 

Horizontal VGs have been recently introduced as a sim- 
plified version of this algorithm For a given time se- 
ries, the vertex sets of VG and HVG are the same, whereas 
the edge set of the HVG maps the mutual horizontal visi- 
bility of two observations Xi and Xj, i.e., there is an edge 
iff Xk < mm(xi, Xj) for all k with f$ < tj, < tj, so 

that 



A (HVG) _ A (HVG) 



Yl 9 (xj - x fc ) 6 (xj - x k ) ■ (3) 



k—i+1 



VG and HVG capture essentially the same properties of 
the system under study (e.g., regarding fractal properties 
of a time series), since the HVG is a subgraph of the VG 
with the same vertex set, but possessing only a subset of 
the VG's edges. Note that the VG is invariant under a 
superposition of linear trends, whereas the HVG is not. 

Time- directed vertex properties. According to their 
definition, the direction of time is intrinsically interwoven 
with the structure of VGs and HVGs. Therefore, it is 
useful to define a set of novel statistical network quantifiers 
taking this fact explicitly into account. At this point, we 
restrict ourselves to two simple vertex characteristics: 

(i) On the one hand, the degree of a vertex in a complex 
network measures the number of edges incident to this 
vertex. Here, for a (H)VG we decompose this quantity for 
a vertex corresponding to a measurement at time tj into 
contributions due to other vertices in the past and future 



of U, respectively: 



k% 



where 



and 



K — E Ai - 

j<i 

fc? = 2J A t 

j>i 



ki , 



(4) 



(5) 
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will be referred to as the retarded degree and advanced 
degree, respectively, in the following. Note that k\ and kf 
correspond to the respective in- and out-degrees of time- 
directed (H)VGs as recently defined in [29] . 

(ii) On the other hand, the local clustering coefficient 



^ ' A-ijAjkAki 



(7) 



is another vertex property of higher order characterising 
the neighbourhood structure of vertex i [8]. Here, for 
studying the connectivity due to past and future obser- 
vations separately, we define the retarded local clustering 
coefficient 



AijAjkA 
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ki 



(8) 



as well as the advanced local clustering coefficient 

-l 

" 1«. (9) 



Cf = 



^ AijAjkAk 

j,k>i 



Hence, both quantities measure the probability that two 
neighbours in the past (future) of observation i are mu- 
tually visible themselves. Note that the decomposition of 
Ci into retarded and advanced contributions is not as sim- 
ple as for the degree and requires consideration of degree- 
related weight factors and an additional term combining 
contributions from the past and future of a given vertex: 



Ci = 



(10) 



The latter contribution shall, however, not be further dis- 
cussed at this point. 

Testing for time-reversibility. Time-reversibility of 
a stationary stochastic process or time series {x{\ 
requires that for arbitrary n and m, the tuples 
{x n , x n -\.\, • ■ • , Xn-j-m) and (x n _|_ m , x n _|_ m _i, . . . , x n ) have 
the same joint probability distribution pi. Instead of test- 
ing this condition explicitly (which is practically unfeasi- 
ble in most situations due to the necessity of estimating 
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high-dimensional probability distribution functions from 
a limited amount of data), it is a common simplification 
to compare the distributions of certain statistical charac- 
teristics obtained from both vectors. VG based methods 
appear particularly suited for this purpose. Specifically, 
in the following we will utilise the frequency distributions 
P(k r,a ) and P(C r,a ) of retarded and advanced degrees and 
local clustering coefficients as representatives for the sta- 
tistical properties of the time series when viewed forward 
and backward in time. 

In the case of time-reversibility, we conjecture that 
both degree sequences {fc[' a } (or local clustering sequences 
{C^ a }) should be drawn from the same probability distri- 
bution, because the visibility structure towards the past 
and future of each observation has to be statistically equiv- 
alent. In turn, for an irreversible (i.e., nonlinear) process, 
we expect to find statistically significant deviations be- 
tween the probability distributions of retarded and ad- 
vanced vertex characteristics. 

As an alternative to the Kullback-Leibler distance be- 
tween the empirically observed distribution functions used 
by Lacasa et al. 
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we propose utilising some standard 
statistics for testing the homogeneity of the distribution 
of random variables between two independent samples. In 
this framework, rejecting the null hypothesis that {k^' a } 
({Cj' a }) are drawn from the same probability distribution, 
respectively, is equivalent to rejecting the null hypothesis 
that the time series under investigation is reversible. Here, 
we consider the Kolmogorov-Smirnov (KS) test as a suit- 
able and generally applicable means to testing this null 
hypothesis, which directly provides p-values for rejecting 
the reversibility hypothesis. Hence, a small p-value of the 
KS test statistic (e.g., p < 0.05) implies that the time se- 
ries has likely been generated by an irreversible stochastic 
process or dynamical system. Even more, the p- values are 
universal and distribution-free in the limit of N — > oo, 
which implies that we do not need to construct surrogate 
time series for obtaining critical values of our test statistics 
as in other reversibility tests. 

Model systems. — Let us illustrate the potentials 
of the proposed method for two simple model systems: 
(a) a linear-stochastic first-order autoregressive (AR(1)) 
process 

x t = axt-i + £ t (11) 

with a = 0.5 and the additive noise term £ t taken as inde- 
pendent realizations of a Gaussian random variable with 
zero mean and unit variance, and (b) the ^-component of 
the nonlinear-deterministic Henon map 



x t = A 



By 



t-i, 



yt = x t -i 



(12) 



with A = 1.4 and B = 0.3. In both cases, we generate 
ensembles of independent realisations with random initial 
conditions and discard the first 1,000 points of each time 
series to avoid possible transients. 




Fig. 1: (Colour online) Distributions of retarded (black) and 
advanced (red) (A,C) degrees fe£'° and (B,D) local cluster- 
ing coefficients C[' a of the standard VG for two simple model 
systems: (A,B) AR(1) process and (C,D) Henon map [x- 
component). Time series of length TV = 500 have been used for 
estimating the probability density functions (PDF) p{k r ' a ) and 
p(C r ' a ) with a kernel density estimator. The mean (solid lines) 
and standard deviation (dashed lines) of the PDFs have been 
computed based on an ensemble of M = 1,000 realizations 
with random initial conditions for both model systems. 



As expected, for the linear (reversible) AR(1) pro- 
cess, the empirical distributions of retarded/advanced de- 
grees and local clustering coefficients collapse onto each 
other (Fig. [I]^,B). Consequently, the null hypothesis of 
reversibility is never rejected by the test based on the de- 
gree (Fig. and only rarely rejected by the clustering- 
based test well below the expected false rejection rate of 
5% (Fig. [2j3) . Similar results are obtained for AR1 realiza- 
tions very close to Brownian motion with a = 0.9, 0.99. In 
contrast, for the irreversible Henon map the distributions 
of retarded and advanced VG measures appear distinct 
already by visual inspection (Fig. [Tp,D). In accordance 
with this observation, the null hypothesis of reversibility 
is nearly always (degree, Fig. [2p) or always (local clus- 
tering coefficient, Fig. [2jD) rejected. Consistently, an even 
higher rejection rate is found for the highly nonlinear and 
infinite dimensional Mackey-Glass system in periodic and 
hyperchaotic regimes 
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All results are qualitatively in- 
dependent of the chosen network construction algorithm 
(VG or HVG). 

To further evaluate the performance of the tests for 
varying sample size N, we consider the fraction q(N) of 
time series from an ensemble of realisations for which the 
null hypothesis of reversibility can be rejected (Fig. [3]). 
For the AR(1) process, it is known that the null hypoth- 
esis is true. Hence, q(N) estimates the probability of type 
I errors (incorrect rejections of true null hypothesis) for 
both tests (Fig.[3jA.,C). To put it differently, l-q(N) mea- 
sures the specificity of the test. Notably, for the standard 
VG, q(N) is always zero for the degree-based test, while it 
fluctuates clearly below the expected type I error rate of 
0.05 for the test based on local clustering (Fig. [3K). For 
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Fig. 2: (Colour online) Frequency distributions of p-values 
of the KS statistic for comparing the distributions of re- 
tarded/advanced (A,C) degree k\' a and (B,D) local cluster- 
ing coefficient C[' a of standard VGs from an ensemble of 
M = 1,000 realisations of model system time series of length 
N = 500: (A,B) AR(1) process, (C,D) Henon map. Vertical 
red lines indicate the typical significance level of 0.05. 



the HVG based tests, q(N) takes slightly higher values, 
which, however, remain below the acceptable error level 
(Fig. §3). 

In contrast to the linear AR(1) process, for the irre- 
versible Henon map the null hypothesis of reversibility is 
known to be false. Therefore, q(N) measures the power 
of the test, whereas 1 — q(N) gives the probability of type 
II errors (failure to reject a false null hypothesis), i.e., 
its sensitivity (Fig. [3|3,D). Interestingly, the power of the 
clustering-based test increases markedly earlier than that 
of the degree-based test. For the standard VG based test 
(Fig. [3j3) , the former reaches q(N) ~ 1 already around 
N = 200, whereas the latter requires twice as many sam- 
ples to arrive at the same power. Notably, the convergence 
towards q(N) = 1 is much faster for the HVG algorithm 
(Fig.|3p), leading to a perfect hit rate at N = 100 and 200 
for local clustering coefficients and degrees, respectively. 

Receiver operating characteristics (ROC curves) enable 
a more systematic comparison between the VG- and HVG- 
based tests. For varying the critical p-value of the KS 
statistic for rejecting the reversibility hypothesis, the re- 
jection probability for AR1 (false positive rate) and Henon 
(true positive rate) time series is plotted. Given a fixed 
true positive rate, the VG-based tests typically have a 
higher error probability (false positive rate) than those 
utilising HVGs (Fig. [§Y,B). As the length N of the in- 
dividual records increases, VG-based tests display better 
convergence properties towards the ideal ROC curve with 
area under ROC curve AU C = 1, whereas the HVG-based 
tests show substantial fluctuations even for relatively long 
time series (Fig. |4]C,D). We attribute the faster conver- 
gence, but larger residual error probability of the HVG- 
based tests to the stronger constraints imposed during net- 
work construction in comparison with the standard VG al- 
gorithm. Since the VG involves more edges than the HVG, 




100 200 300 
N 



Fig. 3: Fraction q(N) of model time series of length N from 
ensembles of M = 1, 000 realisations for which the null hypoth- 
esis of reversibility was rejected at the 0.05 significance level: 
(A,C) AR(1) process, (B,D) Henon map. The results have been 
obtained using (A,B) VGs and (C,D) HVGs using tests based 
on retarded/advanced degrees (solid lines) and local clustering 
coefficients (dash-dotted lines), respectively. The null hypoth- 
esis is never rejected for the VG-based test using time-directed 
degrees applied to the AR(1) time series (A). 



the former is more robust and less resilient to statistical 
fluctuations, but a larger number of vertices is necessary 
for identifying irreversible behaviour in the data under 
study. Notably, particularly for HVGs, the local cluster- 
ing coefficient-based tests systematically perform slightly 
better than those using degree (Fig. [4jZ> ) . 

Real-world example. — To further demonstrate the 
potentials of VG-based reversibility tests for real-world 
data, we apply them to continuous electroencephalogram 
(EEG) recordings for healthy and epileptic patients that 
were previously analysed by Andrzejak et al. 31 . The 
data consist of five sets of M = 100 representative time 
series segments of length N = 4, 096 comprising record- 
ings of brain activity for different patient groups and 
recording regions (Table [TJ. To look for traces of low- 



dimensional nonlinear dynamical behaviour in the data, 



Andrzejak et al. 31 used the nonlinear prediction error 
P and the effective correlation dimension I?2,e// as statis- 
tics to test the null hypothesis Hq™ that the time series 
are compatible with a stationary linear-stochastic Gaus- 
sian process. 

Since irreversibility is a signature of nonlinear dynamics, 
we expect the results of our VG-based tests to be consis- 
tent with those of 



31 



Indeed, the rate of rejections q 
of the null hypothesis of reversibility increases markedly 
from hardly any rejections for set A, where H l ™ could 
not be rejected by 



31 



to q w 1 for set E, where Hq 
was rejected using both test statistics (Table [l]). Hence, 
consistently with the results of 31 , the VG-based tests in- 
dicate reversible linear dynamics for healthy subjects (set 
A, Fig. [5]4., B) and irreversible nonlinear dynamics dur- 
ing epileptic seizures (set E, Fig. [5]C,D). The other data 
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Table 1: Results of VG-based tests for reversibility of EEG time series put into context with the results from 31 based on the 
nonlinear prediction error P and the effective correlation dimension Z)2,e// as statistics to test the null hypothesis Hq" that the 
time series are compatible with a stationary linear-stochastic Gaussian process, (qc) denotes the fraction of time series from 
a set of M — 100 segments for which the null hypothesis of reversibility was rejected by the proposed VG-based tests using the 
retarded/advanced degrees (local clustering coefficients) (see also Fig. |EJ. 
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Fig. 4: (A,B) ROC curves for the (A,C) VG- and (B,D) HVG- 
based tests for reversibility comparing the rejection rates for 
each M = 10, 000 realisations of AR1 (false positive rate) 
and Henon time series (true positive rate) with varying crit- 
ical p-value of the KS statistic (N = 100). (C,D) Area un- 
der the ROC curve (AUC) characterising the discriminative 
performance of all tests depending on time series length N. 
Solid and dash-dotted lines indicate degree- and local cluster- 
ing coefficient-based tests, respectively. 



sets (B-D) are identified as intermediate cases with re- 
spect to the proposed tests, suggesting the presence of 
time-reversal asymmetry and, hence, nonlinear dynamics. 
In summary, our tests perform consistently with those 



applied by 31 which are arguably more complicated both 
technically and conceptually. Furthermore, the results of 
the VG-based tests are consistent with those obtained us- 
ing the classical Q statistics of [2| p. 84] together with 
standard and amplitude adjusted Fourier surrogate time 
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Fig. 5: Frequency distributions of p- values of the KS test for 
comparing the distributions of retarded/advanced (A,C) de- 
gree k\' a and (B,D) local clustering coefficient C[' a of standard 
VGs from a set of M — 100 EEG time series segments of length 
N — 4, 096. Recordings originate from (A,B) healthy subjects 
with eyes open (data set A) and (C,D) epileptic patients dur- 
ing seizure (data set E). Vertical red lines indicate the chosen 
significance level of 0.05. 



32 , which serve as a benchmark test for time series 



series 

reversibility. 

Conclusions. — The statistical tests for reversibility 
of scalar- valued time series proposed in this work highlight 
the important potentials of complex network-based ap- 
proaches for time series analysis problems. Utilising stan- 
dard as well as horizontal VGs for discriminating between 
the properties of observed data forwards and backwards in 
time has at least two important advantages in comparison 
with established tests for time series reversibility: 

(i) The reversibility properties are examined without 
the necessity of constructing surrogate data. Hence, the 
proposed approach saves considerable computational costs 
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in comparison with other state-of-the-art statistical ap- 
proaches developed for the same task and, more impor- 
tantly, avoids the problem of selecting a particular type of 
surrogates. Specifically, utilising the KS test statistic or a 
comparable two-sample test for the homogeneity (equal- 
ity) of the underlying probability distribution functions 
directly supplies a p- value for the associated null hypothe- 
sis that the considered properties of the data forward and 
backward in time are statistically indistinguishable. 

(ii) The proposed approach can be directly applied to 
data with non-uniform sampling, which frequently occur 
in various fields of sciences (e.g., palaeoclimate 26 or as- 
trophysics), or marked point processes (e.g., earthquake 
catalogues [27] ). We emphasise that for the latter classes 
of data, constructing surrogates for linearity tests in the 
most common way by means of Fourier-based techniques 
is a challenging task. Hence, using VG-based methods for 
testing linearity of observational records is a powerful and 
widely applicable approach. 

While this letter highlights the potentials of the pro- 
posed approach, there are methodological questions such 
as the impacts of sampling or observational noise as well 
as a systematic comparison to existing methods for testing 
time series reversibility that need to be systematically ad- 
dressed in future research. Furthermore, VG-based meth- 
ods are generally faced with problems such as boundary 



effects and the ambiguous treatment of missing data 26 
which call for further investigations. 

Finally, there is a variety of other measures characteris- 
ing complex networks on the local (vertex/edge) as well as 
global scale that could be used for similar purposes as the 
degree and local clustering coefficient studied in this work. 
However, the well-established path-based network charac- 
teristics such as closeness, betweenness, or average path 
length cannot be easily decomposed into retarded and ad- 
vanced contributions. Hence, the approach followed here 
is mainly restricted to neighbourhood-based network mea- 
sures like degree, local and global clustering coefficient, or 
network transitivity. As a possible solution, instead of de- 
composing the network properties, the whole edge set of 
a (H)VG could be divided into two disjoint subsets that 
correspond to visibility connections forwards and back- 
wards in time, as originally proposed by Lacasa et al. [29] . 
For these directed (forward and backward) (H)VGs, also 
the path-based measures can be computed separately and 
might provide valuable information. However, path-based 
measures of (H)VGs are known to be strongly influenced 
by boundary effects [26], so that they could possibly lose 
their discriminative power for reversibility tests. 
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